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PubMed Eritrez BLAST OMIM Taxonomy . Structure 
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Match:|l j Mismatch:|-2" gap open:f5 ~~ gap extension: ' 
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Sequence Homo Sapiens mRNA for AIRE protein Length 2245 
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[g| Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTN 2.2.6 [Apr-09-2003] 

Match J Mismatch: ;2 j gap open:|5_ j gap extension: |2 ; 

xdropoff: [50 j expect:| 10.0001 wordsize: |_1 1 Filter F ffijjgj 



Sequence Homo Sapiens mRNA for AIRE protein Length 2245 

Sequence . 7 r 7 -, 81 yfl ld07.sl Soares fetal liver spleen 1NFLS Homo sapiens cDNA , 

2 gi _n/iM. c j one IMAGE: 126541 3', mRNA sequence Lengtn «o 

No significant similarity was found 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi70 
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NCBI 



PubMed 



Entrez 

Search | Nucleotide 

About Entrez > 



Search for Genes 
LocusLink provides 
curated information for 
human, fruit fly, mouse, 

rat, and zebrafish 




~B for |21ES212 ~~~ ~ 

W: Limits Preview/index History 

One of your terms is not found in the database. 
See Details . 
No items found. 
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Batch Entrez: Upload a 
file of Gl or accession 
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sequences 

Check sequence 
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Entrez 



•1 

PubMed 




Nucleotide 



Protein 



Search [Nucleotide gj for | 

Limits Preview/Index 
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Genome Structure PMC Taxonomy Boo 
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History Clipboard Details 
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T ; 1: Z97990. Homo Sapiens mRNA...[gi:2665370] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
FEATURES 

source 



source 



source 



CDS 



HSAPECED 2245 bp mRNA linear PRI 18-JUN-2000 

Homo Sapiens mRNA for AIRE protein. 

Z97990 

Z97990.1 GI :2665370 
Aire protein. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi / 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Aaltonen, J. , Bjrses, P. , Perheentupa, J. , Horelli -Kuitunen, N . , 
Paloti,A., Peltonen,L., Lee,Y.S., Francis, F., Hennig,S., Thiel,C, 
Lehrach , H . and Yaspo , M . L . 

An autoimmune disease, APECED , caused by mutations in a novel gene 
featuring two PHD-type zinc finger domains 
Nat. Genet. 17, 399-403 (1997) 
98061087 

2 (bases 1 to 2245) 
Yaspo, M. L . 
Direct Submission 

Submitted (21- JUL-1997 ) Max Planck Institut fur Molekulare Genetik, 
Ihnestrasse 73, Berlin D-14195, Germany- 
revised by submitter 24-SEP-1997 

Location/ Qualifiers 

1. .2245 

/organism="Homo sapiens" 
/ mo l_t ype = " mRNA " 
/db_xref ="taxon: 9606" 
/ chromosome= " 21" 
/tissue_type=" thymus " 
1. . 1810 

/organism= "Homo sapiens" 

/ m o 1 _ t yp e = " mRN A ' 1 

/ db_xr e f = " t axon : 9 6 0 6 » 

/ chromosome= "21" 

/clone="Bl-l" 

/ t issue_type= " thymus " 

/clone_lib=" lambda GT11 from Clontech (cat# hl5010B) " 
1811. .2180 

/organism="Homo sapiens" 

/ mo l_t yp e = " mRNA " 

/db_xref ="taxon: 9606" 

/ chromosome= " 21" 

/clone="Dl-l" 

/ 1 i s sue_type = " thymus " 

/clone_lib=" lambda GT11 from Clontech (cat* hlSOlOB) » 

121. .1758 

/codon_start=l 

/ product= "AIRE protein" 



http://\\ww.ncbi.nlm.nih.gov/entrez/vievver.fcgi?db=nucleotide&val=2665370 
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/protein id=" CAB10790 . 1 " 
/db_xref = "GI :2665371" 
/db_xref ="GOA:043918 " 
/db_xref ="Swiss-Prot : 043918 " 

/translation= M MATDAALRRLLRLHRTE I AVAVDS AFPLLHALADHDWPEDKFQ 
ETLHLKEKEGCPQAFHALLSWLLTQDSTAILDFWRVLFKDYNLERYGRLQPILDSFPK 
DVDLSQPRKGRKPPAVPKALVPPPRLPTKRKASEEARAAAPAALTPRGTASPGSQLKA 
KPPKKPESSAEQQRLPLGNGIQTMSASVQRAVAMSSGDVPGARGAVEGILIQQVFESG 
GSKKCIQVGGEFYTPSKFEDSGSGKNKARSSSGPKPLVRAKGAQGAAPGGGEARLGQQ* 
GSVPAPLALPSDPQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGT 
WRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSAGEEVRGPPGEPLAGMDTTL 
VYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGARCGVCGDGTDVLRCTHCA 
AAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPARLAPGPAKDDTASHE 
PALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS" 



ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 

II 



cgggcgcaca 
gctgccagtg 
atggcgacgg 
gccgtggaca 
gacaagtttc 
gccctcctgt 
ctgttcaagg 
cccaaagatg 
gctttggtac 
gccgcgccag 
aagcccccca 
attcagacca 
ggagcccgag 
aagaagtgca 
agtgggaaga 
gcccagggcg 
gcccctctgg 
gtgtgtcggg 
gcctgcctgt 
ctgcaggcaa 
cccgtggaga 
cctggggaac 
ccttctgcag 
ggtcctgagg 
acggacgtgc 
gccggcacct 
ccagcccctg 
aaggatgaca 
agcgagcaca 
gcccccttcc 
agaaggacac 
cgccacctct 
catgtgcctg 
tccttgcctg 
taatcccagc 
tgcagtgagc 
ctcaaaaaca 
tcagtggcat 



gccggcgcgg 
tcccgggacc 
acgcggcgct 
gcgccttccc 
aggagacgct 
cctggctgct 
actacaacct 
tggacctcag 
cgccacccag 
cagccctgac 
agaagccgga 
tgtcagcttc 
gggccgtgga 
tccaggttgg 
acaaggcccg 
ctgcccccgg 
ccctccccag 
acggcgggga 
cccctccgct 
cagtccagga 
ccccgctccc 
ccctagccgg 
ccccgctgcc 
gtcagcagaa 
tgcggtgtac 
cccggcccgg 
tggagggggt 
ctgccagtca 
ccttcgatgg 
cctcctgacc 
ctccttcctc 
tgtcagtgct 
gaaattaaac 
gtgacctact 
tacatgggag 
tgagattgcg 
aaacaaaaaa 
tcacatctca 



aggccccaca 
caccgcgtcc 
acgccggctt 
actgctgcac 
tcatctgaag 
gacccaggac 
ggagcgctat 
ccagccccgg 
actccccacc 
tccaaggggc 
gagcagcgca 
agtccagaga 
ggggatcctc 
tggggagttc 
cagcagcagt 
tggaggtgag 
tgacccccag 
gctcatctgc 
ccgggagatc 
ggtgcagccc 
cccggggctt 
catggacacg 
agggctggac 
cctggctcct 
tcactgcgcc 
gacgggcctg 
gctggccccc 
cgagcccgct 
catcctgcag 
ccagatggcc 
agtcctggaa 
cggctgtaaa 
cctgccccac 
aaaaatataa 
cctgaggcat 
ccactgcact 
accacataac 
tgtaa 



gccccgccgg 
gccccagccc 
ctgaggctgc 
gcgctggctg 
gaaaaggagg 
tccacagcca 
ggccggctgc 
aaggggagga 
aagaggaagg 
accgccagcc 
gagcagcagc 
gctgtggcca 
atccagcagg 
tacactccca 
ggcccgaagc 
gctaggctgg 
ctccaccaga 
tgtgacggct 
cccagtggga 
cgggcagagg 
aggtcggcgg 
actcttgtct 
tcctcggccc 
ggtgcgcgtt 
gctgccttcc 
cgctgcagat 
agccccgccc 
ctgcacaggg 
tgggccatcc 
gggacatgca 
gccggccggc 
cagctctgtg 
ttctctactc 
aaattagctg 
gagaatcact 
ccagtctggt 
ataaatttat 



gacccgaggc 
cgggtccccg 
accgcacgga 
accacgacgt 
gctgccccca 
tcctggactt 
agcccatcct 
agcccccggc 
cctcagaaga 
caggctctca 
gccttccact 
tgtcctccgg 
tgtttgagtc 
gcaagttcga 
ctctggttcg 
gccagcaggg 
agaatgagga 
gccctcgggc 
cctggaggtg 
agccccggcc 
gagaggaggt 
acaagcacct 
tgcaccccct 
gcggggtgtg 
actggcgctg 
cctgctcagg 
gcctggcccc 
atgacctgga 
agagcatggc 
gctctgatga 
tgggatcaag 
tttctgggga 
tggaagtccc 
ggtgtggtgg 
tgaactcggg 
cggcaagagt 
catctcgacc 



caagcgaggg 
cgcccacccc 
gatcgcggtg 
ggtccccgag 
ggccttccac 
ctggagggtg 
ggacagcttc 
cgtccccaag 
ggctcgagct 
actgaaggcc 
cgggaacggg 
ggacgtcccg 
aggcggctcc 
agactccggc 
agccaaggga 
cagcgttccc 
cgagtgtgcc 
cttccacctg 
ctccagctgc 
ccaggagcca 
aagaggtcca 
gccggctccg 
actgtgtgtg 
cggagatggt 
ccacttccca 
agacgtgacc 
tgggcctgcc 
gtcccttctg 
ccgtccggcg 
gagagtgctg 
aaggggacag 
caccagccat 
cgggagcctc 
tgggtgcctg 
a ggtggaggt 
gagactccgt 
acttttcagt 



Disclaimer | Write to the Help Desk 
NCBI | NLM | NIH 



http://www.ncbi.nlm.rn 



2/19/2004 



NCBI Sequence Viewer 



Page 3 of 3 

Jan 29 2004 15:38:25 



http://www.ncbi.nlm.nih.go 



2/19/2004 



NCBI Sequence Viewer 



Page 1 of 2 




Nucleotide ! 



Entrez PubMed Nucleotide Protein 



Genome 



Structure 



PMC Taxonomy 



Boo 



Search | Nucleotide -J forf 




GoJ Clear | 



Limits Preview/Index 



History 



Clipboard 



Details 




File 




ni: R06810 . yflld07.rl Soares...[gi:757430] 



Links 



IDENTIFIERS 

dbEST Id: 

EST name : 
GenBank Acc : 
GenBank gi : 
GDB Id: 

CLONE INFO 

Clone Id: 
Insert length: 
DNA type : 

PRIMERS 

Sequencing : 
PolyA Tail : 

SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 

COMMENTS 



LIBRARY 

Lib Name : 
Organism : 
Sex: 
Organ : 

Develop, stage: 
Lab host: 
Vector : 
R. Site 1: 
R . Site 2 : 
Description : 



177106 

yflld07.rl 
R06810 
757430 
478702 



IMAGE: 126541 (5') 

1068 

CDNA 



M13RP1 
Unknown 



GGGAAATCCCAGGCTGCCGCGGCTCCTGTGTGCTTTGAAGGTAACTTTCCTTTGCTCTCG 
CGCTCCTTGGGCTCAGTGCCCTTTCAGTCTGATGAGATCTGGGCGGTTTTCTTCAAGCCT 
TCTCTGACAATTCCATCACTGTCCTCACCCTCTTCTTTTTTTTCTAGAACTTTGGTTAGC 
CAGATACAGACTCTCCTGGTTTGATTCTGTCTCTTTTCATTCTACATTCTAGAAGATACT 
CTTAACTTTATATTTCAGTCATGTTAATGAATGTCTTCTTTTAATGCCAGCAATTGCATT 
TTTAATGTCCAAGCTCTCTGTCTCATTAGTAGGCATTTTTTATTTCGTAAGTGCAGTATC 
ATCTTGGAATACTTCTGAAGACAAGTATTAACTTAAAAAAAATACCNTTCTGTTTNTGGA 
GCTAGCCCAGTTTCTTTNCAGCCT 

High quality sequence stops at base: 345 

Apr 3 1995 
Apr 3 1995 



Insert Size: 1068 

High qality sequence stops: 345 Source: IMAGE Consortium, 
LLNL This clone is available royalty- free through LLNL ; 
contact the IMAGE Consortium ( inf o@image . llnl . gov ) for 
further information. 



Soares fetal liver spleen 1NFLS 

Homo sapien s 

male 

Liver and Spleen 

20 week-post conception fetus 

DH10B (ampicillin resistant) 

pT7T3D (Pharmacia) with a modified polylinker 
Pac I 
Eco RI 

1st strand cDNA was primed with a Pac I - oligo(dT) primer 



http://www.ncbi.nlm. nih.gov/entrez/viewer.fcgi?db=nucleotide&val=757430 
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SUBMITTER 

Name : 

Institution : 
Address : 
Tel : 
Fax : 
E-mail : 

CITATIONS 

Title: 
Authors : 



Year: 
Status : 



MAP DATA 



[ 5 ' AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3 ' ] , 
double -stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization. Library constructed by 
Bento Soares and M.Fatima Bonaldo. 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 



The WashU-Merck EST Project 

Hillier,L., Clark, N., Dubuque, T., Elliston, K. , Hawkins , M . , 
Holman,M., Hultman,M., Kucaba,T\, Le , M . , Lennon,G., Marra,M. 
, Parsons, J., Rifkin,L., Rohlfing,T., Soares, M., Tan,F., 
Trevaskis,E. , Waterston,R. , Williamson, A. , Wohldmann,P. , 
Wilson, R. 
1995 

Unpublished 
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Display default 
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Search | Nucleotide 3 forT 
Limits 



ill Show: 
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Preview/Index 



History 



Go | Clear 
Clipboard 
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Send to File 



l~; 1: R0676L yfl ld07.sl Soares...[gi:757381] 



Boo 

Details 
Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 
GDB Id: 



177057 

yf lld07 . si 
R06761 
757381 
478702 



CLONE INFO 

Clone Id: 
Insert length : 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail : 



IMAGE: 126541 

1068 

cDNA 



SP6 

Unknown 



(3') 



SEQUENCE 



Quality : 

Entry Created: 
Last Updated: 

COMMENTS 



CAGGACTTGAGGCTTTATCCTGAAAGAAGACGTAGGCAGTCACTTCCAATCATCTAACGA 

ATGAGCTGGTGGGGGGCAGGGGGCGGGCCTCCAGGGGGTCTTGAGAGCAGAGTCGCGCAC 

CCTGGATGGAAAGGCACCACAAGTGACTTTCTCGTCAACTAGGCAAGGTCTCCTAACAGC 

TGTTGTGCCACGGAGAGGCTGCTGTCCTGGAGGTCCTGCATTCTAAATCACAAAGTGAGG 

TGACAAGAGAAAGGATCCCTGCAGTGGGTTGGGGAGTGCGTTTTGGAGATTATCATCCTA 

GATCTACCAACACCGTGATCCTAAAGCCCCTGTGATAACACCAAGTCCAATTTACAGTGT 

CTTTTTCACGACAATTGCACCAAGGTCTTCAAGATTGCCCCAGTTGCCCCAGAGC 

High quality sequence stops at base: 346 

Apr 3 1995 
Apr 3 1995 



Insert Size: 1068 

High qality sequence stops: 346 Source: IMAGE Consortium, 
LLNL This clone is available royalty- free through LLNL ; 
contact the IMAGE Consortium ( inf o@image . llnl . gov ) for 
further information. 



LIBRARY 

Lib Name : 
Organism : 
Sex: 
Organ : 

Develop, stage: 
Lab host: 
Vector : 
R. Site 1: 
R . Site 2: 
Description : 



Soares fetal liver spleen 1NFLS 

Homo sapiens 

male 

Liver and Spleen 

20 week-post conception fetus 

DH10B {ampicillin resistant) 

pT7T3D (Pharmacia) with a modified polylinker 
Pac I 
Eco RI 

1st strand cDNA was primed with a Pac I - oligo(dT) primer 
[5' AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3'], 
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SUBMITTER 

Name : 

Institution: 
Address : 
Tel: 
Fax: 
E-mail : 

CITATIONS 

Title: 
Authors : 



Year: 
Status : 



MAP DATA 



double -stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization. Library constructed by 
Bento Soares and M.Fatima Bonaldo. 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 



The WashU-Merck EST Project 

Hillier,L., Clark, N., Dubuque, T., Elliston,K., Hawkins, M., 
Holman,M., Hultman,M., Kucaba,T., Le,M., Lennon,G., Marra,M. 
, Parsons, J., Rifkin,L., Rohlfing,T., Soares, M., Tan,F., 
Trevaskis , E . , Waterston, R . , Williamson, A. , Wohldmann, P . , 
Wilson, R . 
1995 

Unpublished 
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Entree PubMecf Nucleotide Protein Genome Structure PMC Taxonomy Boo 

Search [Nucleotide 3 for | jGoj Clear | 

Limits Preview/Index History Clipboard Details 

Display | j default _ 2f Show: 1 20 Send to | [File jll Get Subsequence j Featui 

H 1: AB006684 . Homo sapiens APEC...[gi:2696618] Links 



LOCUS AB006684 20000 bp DNA linear PRI 14-APR-2000 

DEFINITION Homo sapiens APECED gene for AIRE-1, AIRE-2, AIRE-3, complete cds . 
ACCESSION AB006684 ' 
VERSION AB006684.1 GI:2696618 

KEYWORDS AIRE-1; AIRE-3; AIRE-2; autoimmune regulator-1; APECED; alternative 

splicing . 
SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (sites) 

AUTHORS Nagamine,K., Peterson,?., Scott, H.S., Kudoh,J., Minoshima , S . , 
Heino,M., Krohn, K. J. E . , Lalioti , M . D . , Mullis,P.E., 
Antonarakis , S . E . , Kawasaki, K., Asakawa,S., Ito,F. and Shimizu, N. 
TITLE Positional cloning of the APECED gene 

JOURNAL Nat. Genet. 17 (4), 393-398 (1997) 
MEDLINE 980610 86 
PUBMED 9398839 
REFERENCE 2 (bases 1 to 20000) 
AUTHORS Shimizu , N . 
TITLE Direct Submission 

JOURNAL Submitted ( 16 -AUG- 1997 ) Nobuyoshi Shimizu, Keio University School 
of Medicine, Department of Molecular Biology; 35 Shinanomachi , 
Shinjuku-ku, Tokyo 160-8582, Japan 

(E-mail : shimizu@dmb . med . keio .ac.jp, Tel:81-3-3351-2370(ex.2720), 
Fax: 81-3-3351-2370) 
FEATURES Location/Qualifiers 
source 1 . . 20000 

/organism="Homo sapiens" 

/mol_type= "genomic DNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/ chromosome= "21" 

/map="21q22 .3" 

/clone="D4Gll" 

/cell_line="GM13 0B" 

/cell_type= n B-lymphoblastoid" 
mRNA join (4 5 80. . 4 83 8 , 52 57 . . 54 3 1 , 5 678 . . 583 3 , 6217 . . 6291 , 

7045.-7158,8357. .8502,868 8. .8768,97 95. .9910,110 02. .11101, 

11693 .. 1187 5, 124 89. . 12 610, 13100 . . 13 2 02 , 15082 . . 15144, 

16355. . 16688) 

/note="AIRE-l" 
exon 4580. .4838 

/number=l 
5 'UTR 4580. .4706 

gene 4707 . . 16426 

/gene= "APECED" 

CDS join (47 07. .4 83 8,52 57. .5431,5 678. .583 3,6217. .62 91, 

7045.-7158,8357. .8502,8 6 88. .8768,97 95. .9910,11002. .11101, 
116 93. .11875,124 89. .12 610,13100. .13 2 02,15082. .15144, 
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exon 



exon 



exon 



exon 



exon 



exon 



mRNA 



mRNA 



5 ' UTR 



exon 



exon 



CDS 



16355 . .16426) 
/gene="APECED" 

/note= "alternative splicing" 
/codon_start=l 
/product="AIRE-l" 
/protein_id= " BAA23990.1 " 
/db_xref="GI :2696619" 

/ 1 rans la t ion= » MATDAALRRLLRLHRTEIAVAVDSAFPLLHALADHDWPEDKFQ 

ETLHLKEKEGCPQAFHALLSWLLTQDSTAILDFWRVLFKDYNLERYGRLQPILDSFPK 

DVDLSQPRKGRKPPAVPKALVPPPRLPTKRKASEEARAAAPAALTPRGTASPGSQLKA 

KPPKKPESSAEQQRLPLGNGIQTMSASVQRAVAMSSGDVPGARGAVEGILIQQVFESG 

GSKKCIQVGGEFYTPSKFEDSGSGKNKARSSSGPKPLVRAKGAQGAAPGGGEARLGQQ 

GSVPAPLALPSDPQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGT 

WRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSAGEEVRGPPGEPLAGMDTTL 

VYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGARCGVCGDGTDVLRCTHCA 

AAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPARLAPGPAKDDTASHE 

PALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS" 

5257 . . 5431 

/gene= "APECED" 

/ number=2 

5678 . .5833 

/gene= "APECED" 

/ number =3 

6217. .6291 

/gene= "APECED" 

/number =4 

7045. .7158 

/gene= "APECED" 

/number=5 

8357. .8502 

/gene= "APECED" 

/ number =6 

8688 . . 8768 

/gene= "APECED" 

/ number =7 

join (8 965. .9148,9455. .9910,11002. .11101,11693. .11875, 
124 89. .12 610,13100. .132 02,15082. .15144,163 55. .166 88) 
/note="AIRE-2" 

join (8965 . . 9148 , 9455 . . 9910, 11002 . . 11101 , 11693 . .11725, 

11808. .11875, 12489. .12 610, 13100. .13 202, 15082. .1514 4, 

16355 . . 16688) 

/note="AIRE-3" 

join (8965. . 9148 , 9455 . . 9506 ) 

/gene= "APECED" 

8965 . .9148 

/gene= "APECED" 

/ number =1 

9455 . . 9910 

/gene= "APECED" 

/number =8 

join (9507 ..9910, 11002. .111 01, 11693 .. 11875, 12489 . .12610, 

13100. .13202,15082. .15144,16355. .16426) 

/gene= "APECED" 

/note="alternative splicing 

autoimmune regulator- 2" 

/codon_start=l 

/product="AIRE-2" 

/protein_id= " BAA23991 . 1 " 

/db_xref ="GI :2696620 n 

/ translation="MWLVYSSGAPGTQQPARNRVFFPIGMAPGGVCWRPDGWGTGGQG 
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RISGPGSMGAGQRLGSSGTQRCCWGSCFGKEVALRRVLHPSPVCMGVSCLCQKNEDEC 
AVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQEVQPRAEEPRP 
QEPPVETPLPPGLRSAGEEVRGPPGEPLAGMDTTLVYKHLPAPPSAAPLPGLDSSALH 
PLLCVGPEGQQNLAPGARCGVCGDGTDVLRCTHCAAAFHWRCHFPAGTSRPGTGLRCR 
SCSGDVTPAPVEGVLAPSPARLAPGPAKDDTASHEPALHRDDLESLLSEHTFDGILQW 
AIQSMARPAAPFPS " 

CDS join (95 07 ..9910, 110 02. .11101, 116 93. .11725, 11 80 8.. 11875, 

124 89. .12 610, 13100.. 13 13 7) 
/gene= "APECED" 

/note= M alternative splicing" 
/codon_start=l 
/product= M AIRE-3 M 
/protein id= " BAA2 3 992.1 " 
/dbjxref ="GI : 2 6 96621" 

/ 1 r ans 1 a t ion= " MWLVYS SGAPGTQQPARNRVFFP I GMAPGGVCWRPDGWGTGGQG 

RISGPGSMGAGQRLGSSGTQRCCWGSCFGKEVALRRVLHPSPVCMGVSCLCQKNEDEC 

AVCRDGGEL I CCDGCPRAFHLACLS PPLRE I PSGTWRCS S CLQATVQE VQPRAEEPRP 

QEPPVETPLPPGLRSAGEEPRCQGWTPRPCTPYCVWVLRVSRTWLLVRVAGCAEMVRT 

CCGVLTAPLPSTGAATSQPAPPGPGRACAADPAQET" 
exon 9795 . . 9910 

/gene="APECED " 

/number =8 
exon 11002 . . 11101 

/gene= " APECED" 

/number =9 
exon 11693.. 11875 

/gene= M APECED" 

/number =10 
exon 11693 . . 11725 

/gene= "APECED" 

/note="10a" 

/number=10 
exon 11808. .11875 

/gene="APECED" 

/note= M 10b" 

/number=10 
exon 12489.. 12610 

/gene= "APECED" 

/ number=ll 
exon 13100.. 13202 

/gene=" APECED" 

/number=12 

3 ' UTR join (1313 8. .13202,15082. .15144,16355. .16688) 

exon 15082.. 15144 

/gene="APECED" 

/ number =13 
exon 16355.. 16688 

/ number =14 
3 'UTR 16427.. 16688 

ORIGIN 

1 tcaaaaaaaa gaagaaaaaa atctggcacc acttccccta gttcactgag aaaagtgacg 
61 ccgtctggac acatacccac agtgggacgc agtggagagg ccagaccagg gaaactgagg 
121 cccaaccagt ggggcaggtt tgaggcaaga gtagtaggag gggagggggc agggggtccc 
181 ccctccaatg cccttggctg agcacgggtt cttctgagct caccggtgct cggagctgcc 
241 tccctgactt ggacaaccat gtggaatggc catcccgccc tcaacaagta gaggcaactt 
301 cctctctaca gtgaggaaat tgaggcccag tgaggaagcc tgtggccagg agggtgctag 
361 gggtcctcac aaagtgtcct gaggggaccc acccctccct tgatgccaga ccacctggcc 
421 aggctgcagg agcagggccc agggcctcca ggagccaagg cctggccagg gctccatgtt 
481 ccccgaggcc tttctgactc aggctcctgg cctctgggag cctggccctt ccagggagcc 
541 acaggggatg gaacctccat ctggagcagg ggagaaggca tggagtccag gcttgtttga 
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601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
■ 2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 



ggggccaagg 
cccaacttct 
gacgtgacct 
ctgttcttcc 
tggccaagct 
gatgatgacc 
gcctagcccg 
aagggctgcg 
actgtcccag 
aggcatcagt 
ctgtgccacc 
ggaaagcccc 
ccatgcaaga 
gtgacccgca 
tgcagttttt 
catctgggga 
gaaggggtgg 
tggaaaagag 
cagcatacgg 
ggtgtttctc 
caggtaagca 
gctttagata 
ttgtcccacc 
gggttttata 
tgtctcaact 
ttacggtgtc 
ctatcacatg 
ccttccgcag 
cgagcatgct 
tttaaccctg 
agattaaaat 
tctttctttt 
ctggtcatcc 
actgtgcagc 
cgtgagaccc 
atcctgtgca 
tgtaaataca 
ctggggacag 
cttccctctc 
cagcgagagg 
tgttccaacc 
tcagctcccc 
tggccccacc 
tctcagacgt 
accgacccct 
aaaagcaaac 
tagcccccag 
ccctaatcca 
gagacctctc 
cagacaaggg 
tgcggggcca 
tttactgggg 
agctcccctc 
ccctcagcct 
ctcaggaccc 
agggggtcaa 
cgaggctcag 
tcaggagctc 
catcgggctc 
aggtccccca 



ctgagccccc 
atacacatca 
gggcacctga 
gtgcatcaag 
cctcccggca 
tgaggagacc 
gggcatggca 
gggccctccc 
agctctgctg 
cccactgaag 
tccagggagc 
acggcatggt 
ggctgggtga 
aagactgcag 
aattcccctt 
gggaaaagcc 
gttgcccctc 
acacagacaa 
aggacccccg 
ggagaggggg 
cgtgaacaaa 
tgtatacaca 
tccagcccta 
ccgagacatt 
gcaacgaggc 
gggctggggg 
gggagaaacc 
tgttttgtgt 
gccttcaagc 
agttgacaca 
ggagtctctt 
ccccacagcc 
caaagctgtg 
cagtgggccc 
ctgcccatgg 
aagctgggct 
gaactgcagg 
gatgtgtctg 
ggggtcttcg 
gccctgctcc 
ctgcactgcc 
atgacgtccc 
ttcctcccca 
gcagaacgct 
ggtctagagg 
ggcacgcctg 
cccagtgctg 
tgctggccag 
gtggttctct 
gactgttacc 
ccacagcctc 
gcactgtccc 
ccggacacct 
ctgtcactgg 
ccgttgtggg 
tgcgagcctc 
agaaggaaag 
cctggcaaca 
ctcccacagc 
ggaacaccac 



tccagcccct 
catgggccgc 
ggggcacccg 
gtccgagtcg 
acaggcctca 
gttctgctga 
ggtccgcagg 
gctccagccg 
cccaggtgcc 
acagggccga 
cgtggcggct 
ggaaagtccg 
cgcctgccca 
gtctttttta 
taaccatctc 
accgggttgc 
cacacctgtg 
agtatagaga 
ctggcctctg 
atgtggcagg 
ggtctctgca 
taaacatctc 
aggcagtttc 
ccattgccca 
gttccttcca 
acggtcaggt 
ttggacaata 
ccctgggtac 
atttgtttaa 
gcacatgtct 
atgtctactt 
tcaaaggaga 
tgcgctggtt 
aagggttgcc 
atgagcaagg 
tctgtgcaac 
ggagagggca 
ttggagcggg 
gaaggcgcag 
ccgctcaagg 
tgatactcca 
cagcacctga 
ggtttgttcc 
cagggggacc 
agaggtgtct 
cccacgtggg 
cctggattca 
gtcatctcgt 
gcagcctgca 
aaggcctccc 
agcatcattg 
cactcacgtc 
tcccagatgc 
tcctaggaag 
gctggccgcc 
aatccccagg 
gacttgcctg 
cccaggctgc 
cccaggcggg 
cccgatccag 



tccctgctct 
gtccccccac 
gcacagcagg 
tggttctgcc 
gtttccatat 
tggagtgagc 
cgtccaaggg 
aggacacggg 
tcttggggaa 
gtccccagcc 
ccatgtcacc 
aaattctaca 
caggggtcag 
aaaagtgaag 
tgggctaagg 
agggggtgac 
ggtgtttctc 
aagaaattgg 
agttccctta 
atcataggat 
tcataaacaa 
aatgccttaa 
cccctatctc 
gggacgggca 
cttttactaa 
ctttcccttc 
cctggctttc 
ttaagattag 
taaagcacac 
cagggagcac 
tctatgcaga 
ggaaagccaa 
ccctggcacc 
attgaacttg 
agaatggggg 
tttcgcttgc 
ggaaacccag 
gaccggcaag 
tgcagggagt 
ctcccaggac 
cagccaccac 
ccccaggctg 
caaacacccc 
ctgggcctga 
gccgctctcc 
aatcttggtg 
ggcttcctgg 
tctctggctc 
cacagggcct 
actgaggcca 
caggccccag 
cacctgggac 
ccccggaagc 
accctttggg 
ttccttctaa 
cgagtgtgcc 
gcgtcacaca 
cgtaccgtct 
cccctgagta 
cctccatgga 



gccattagga 
tttgcaggct 
agcatggacc 
ccttagcagc 
ctgtggatgg 
agaatcacaa 
cacagcacag 
gctcagggat 
actgaggctc 
cctcaccccc 
tgcgggcaag 
ggggcctctt 
gccccagccc 
tgcagacttg 
catccattga 
tttggggcag 
gttaggtaga 
gggaccaggg 
gtatttattg 
aatagtggag 
ggtaaagaat 
ggagcagtat 
agtagatgga 
ggagacagat 
tcctcctcag 
ccaggaggcc 
ctaggcaggg 
ggagtggtga 
cctgcacagc 
agggttgggg 
cacattaaca 
gttgctggga 
ccaggggctt 
atgttagtga 
tgtttcagga 
agttatttaa 
cgagcagcag 
gcaggcatgc 
gagagacgcc 
attccacaca 
tgagatggac 
accacctcgg 
tagagtcttt 
aggcggtagg 
atcctggact 
gcgtctctca 
taccttcgtc 
agtgggaaca 
gcttccctgg 
ccccccctcc 
gcctctgcac 
cctcggctcc 
tcctgtccag 
agctctcact 
agcacctgga 
ccttttaaag 
gctagcctaa 
cctgcatcag 
ggggctctca 
ggctcttgcc 



cctggctcct 
gagaacatct 
tggagaacct 
tgcgtgactc 
cactaatcac 
agctctcagg 
atgtgggaga 
ggagcccaat 
agtcccacaa 
ctccaggggt 
gggctggtgt 
tgttaaacct 
aatgacacag 
gatctcgctt 
ctcacttaac 
gatggcctct 
acgagagact 
gaccggcgct 
atcatttttg 
agaaggtcag 
taagtgctgt 
tgctgcctgc 
atatacaatc 
gccttcctct 
cacagaccct 
atatctcagg 
gtccctgcgg 
tgactcttaa 
ccttaatcca 
gtagggttac 
atctgatctc 
gcaacggtta 
ttatgcatga 
gctctccctt 
gcagacaggg 
atatttttgc 
cccggcctgg 
aggcaggggg 
caggcctggg 
ggagccagcc 
cccctccccc 
ccaccacctc 
tccggagcaa 
agaaaccgag 
ctggctgccc 
ctgcatcagt 
cacccttcag 
ctcgtgccca 
cttgaccgca 
aggccatgcc 
ctggtcttgt 
tgtccactgc 
gccacagcat 
cagggccaca 
ggaaggaagg 
atgagggaac 
gatggtgagg 
actgagcctc 
gcttgtgtgg 
ggccactggg 
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42 01 aggggccggt gcaccctggg cagcccctgc cagggccctg agacccgagc ctccccgccg 

4261 agggcacctg tctcggcttt gccccattcg agcagggccc tcgccgaggc aggacagggc 

4321 cacattcgga agtgagagtt ctctgagtcc cgcacagagc gagtctctgt ccccagcccc 

4381 caaggcagct gccctggtgg gtgagtcagg ccaggcccgg agacttcccg agagcgaggg 

4441 agggacagca gcgcctccat cacagggaag tgtccctgcg ggaggccctg gccctgattg 

4501 ggcgccgggg cggagcggcc tttgctcttt gcgtggtcgc gggggtataa cagcggcgcg 

4561 cgtggctcgc agaccgggga gacgggcggg cgcacagccg gcgcggaggc cccacagccc 

4621 cgccgggacc cgaggccaag cgaggggctg ccagtgtccc gggacccacc gcgtccgccc 

4681 cagccccggg tccccgcgcc caccccatgg cgacggacgc ggcgctacgc cggcttctga 

4741 ggctgcaccg cacggagatc gcggtggccg tggacagcgc cttcccactg ctgcacgcgc 

4801 tggctgacca cgacgtggtc cccgaggaca agtttcaggt gggctccccg cccgcccccc 

4861 gctgccccca ggccctgtga gccagggata gtccccgggg aagttccagg aggaccccgc 

4921 ccctccagat ccccaagccc ctccagcctt ccccaactcc ctccccacaa ggagccaggg 

4981 gcgtccctga tgacaagtta gaagttggtc cccttccccc agccgtcccc acacctcacc 

5041 cccaagccaa gggaatggcc tccaggttcc cccagcccca ccctcaacac ccctacacca 

5101 ccacctgact ccaccacaag ccgaggagat gggcgtggag ctgtccaggt cgccagcgcc 

5161 tctgcctggg agctccaccc tctagtcatg atggagatgg gcaggccgca gggtgtgggg 

5221 gaccatggca gggaccctca tgccacccca ctgcaggaga cgcttcatct gaaggaaaag 

5281 gagggctgcc cccaggcctt ccacgccctc ctgtcctggc tgctgaccca ggactccaca 

5341 gccatcctgg acttctggag ggtgctgttc aaggactaca acctggagcg ctatggccgg 

5401 ctgcagccca tcctggacag cttccccaaa ggtgggtcct ggtggactca gccatgctgg 

5461 gggcctgggg cagctgctgt cacctgctca gcccagctgg actggaaccg gagtggtgtt 

5521 tgaggagccc gtgggtgatg ttccaggacc gtcttggatc ctaagaggca aaggggccag 

5581 gcctcacctg tctggccaag gtgtccagtt ctggggccca ccctacccct ggagaaaacc 

5641 ctgaggttgg gaccctgctc ctgcccctga gctgcagatg tggacctcag ccagccccgg 

5701 aaggggagga agcccccggc cgtccccaag gctttggtac cgccacccag actccccacc 

5761 aagaggaagg cctcagaaga ggctcgagct gccgcgccag cagccctgac tccaaggggc 

5821 accgccagcc caggtaccct ccctgcaggg gaagccagcc agggtctcca gtcttcccgg 

5881 gcttccccgg gagcccacgc cccctcccca cccgggctcc cacccactgg gtgtggggcc 

5941 agcctgcctg gggctgtggg ggtctcctct gggtactaga cccacacact ggaccagcct 

6001 ctcagctccc tcctgcctga aggctgagct ccccggagct ggtgaagtag gcgggcgggt 

6061 ctcatttccc ttttactgat gagaaaccag agcccggcaa agggactacc cagcactgga 

6121 ccgccccctc cacgccctcc caccgcgggc ccctgcccac cggcactcac ccccactgag 

6181 aggggaggcc aggctgcccc cagctccccc attcaggctc tcaactgaag gccaagcccc 

6241 ccaagaagcc ggagagcagc gcagagcagc agcgccttcc actcgggaac ggtgagcggg 

6301 gcccagtggg agcgcctccc ttctccctgg ccaggggcaa ggggtcaggg gtcagagcag 

6361 ggcctgccct ctgagaccct gtcctagggg ctggggacgt gctggcctgg tgtgtcattc 

6421 caagggccta agctgcacca ccagacccag gaaggggaca ccttgggtct aagcatgatc 

6481 ttgccagtcg cccctgcccc cactgcaccc tggttctggg acccccttct caggcacctt 

6541 ctctgcccgt ccactcccta tccttcagga ccagcctaga catagcttcc tccagaaaat 

6601 catccctggc ccccagctgc atgcaggctg aacccttcct gtccccttct ccttccttcc 

6661 cagggcactg gactccagag accccctatc tccctgaggg cagagcctag gaactctgtg 

6721 tccctcccgg cacaatacag ggcccatgtc atgggggggt gggtctggtc attggtcatg 

6781 ccttcctatc cattgtgcca gctctgctga cactgccacc ccccagcaca cgcacacttg 

6841 ggtgcacaca cgaacacaca cattctcatg tctctgcact tacctgtggg ctgtctgcac 

6901 atggcagggc tgggtcccct ccttggcctg ccctggctgg aaggaaaggg ctctgcagcc 

6961 cagtgctgcc tgcttctggc atagagtatg tgcttgggaa cagtcttccc cacgggtgac 

7021 cccaatgggt gttccctttc ccagggattc agaccatgtc agcttcagtc cagagagctg 

7081 tggccatgtc ctccggggac gtcccgggag cccgaggggc cgtggagggg atcctcatcc 

7141 agcaggtgtt tgagtcaggt agacgctgtg gcggggagat ggggctgatg gggagaccca 

7201 ggctccaaga tggaaggagg accacgcccc tttgcatcct ggtggtccca cagcagaccg 

7261 gactgttgct caggtagcca gagtttctgc ctgtggttct gctgactttg gaggaggagg 

7321 gtgagcactg aagtctccct gtcgggggac cttctgcaag gccagcggtc caggcccaca 

7381 tccccacccg ggatgtacag cactccccag tcacctccat ccatgtgcat gggccctcct 

7441 gggccatggg gttgcatcct tagaaagttc tgcctgtgct gctgagaccc tccagggtat 

7501 cggcattctt caaccaggac agcctgtagc atagcgtcct tgccccccat accctggcca 

7561 gcctgcagca tcctcgcccg ccattccctg gccagccgct gaccccatgc aatcaccagt 

7621 gccatctgac cagggcacag cagggccgct ggtggcagac ccaccgtgcc atcggggcat 

7681 tccatctcaa gtccctgaca cggtgtctcc tcggtgctgg acatgggctg ggaacaccaa 

7741 gcacagccag ggccctggtc ttgcacctct ggatggtccc aaggcccact gtgttacttc 
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7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 



ctaaggctgt 
tcgaggtgtc 
ctcccagctg 
caccgctcca 
ggacaccagt 
aagaccctga 
gacgctgttg 
ggtgggggcg 
accctggggc 
cggcatctcc 
cccagcaagt 
aagcctctgg 
ctggggagcc 
gctctgctgg 
ctcgctgctg 
ctaggctggg 
tccaccaggt 
tgggtcctgc 
cctacccaca 
caatggtaat 
ggcgtttggt 
cctgctatag 
gcatggtttc 
tcttgaccac 
agtcctgccc 
aggtcattgc 
gtacctggga 
ggggctgcag 
ccccactgcc 
cgcagcagcc 
gttggagacc 
tgggagcagg 
ttgggaagga 
cttgcctgtg 
tctgctgtga 
agatccccag 
cctccaccca 
gggttctgag 
cctgcctccc 
gcaaagcaga 
cagcccacgc 
ccagacaggg 
ttgtccacca 
catcccggtg 
cgtggcagtc 
tctgccccag 
agaataaggg 
tctttctgcc 
cagtttaggg 
cgcaggtgtt 
ctggccaggg 
ctctgcccat 
ctggagctcc 
gactggtgga 
ccaggaggtg 
ggtatggcca 
cctaggctgg 
ccccagccat 
gagtccccat 
ggggggcact 



tggttaaatt 
tcggggccac 
ctggtcatta 
gcctctgctt 
cattgaactt 
tttcaagtaa 
aacaccctgg 
ggctggagga 
ctacacgact 
tcccaggcgg 
tcgaagactc 
ttcgagccaa 
tggctcttga 
gggctggggg 
aggctgcccc 
ccagcagggc 
aatgccctag 
tgcctctgcc 
gggtacagct 
agtttaaatg 
ccaggctgta 
ccaggaggtc 
ttaatggggt 
ttggcaccag 
tgcaggagca 
cgtgcaggac 
gacccctgaa 
gtggaggatt 
ctgtgaggaa 
tgcaagaaac 
agatggatgg 
gcagagactg 
ggtggctctc 
ccagaagaat 
cggctgccct 
gtgagcctgc 
ctgaccctga 
tcaggtcact 
ggtctggcca 
acagaggccc 
ccccatgggt 
ctggggagca 
atgcacagga 
ctccttcccc 
tgtgggagga 
cccctgagtg 
gcacagtggg 
cttgatcacc 
aggcccccgg 

ggggattcct 

ccaccccacg 
ctctctgctg 
acccgtgggt 
cacacgagca 
cagccccggg 
cgccccctcc 
gccaccccct 
agcactatgt 
catggttcct 
atgagcattg 



ggcacaaact 
actccctctg 
tgggggtacc 
ctgtggtctc 
atggtccagt 
ggtcacactc 
tgtagatcca 
atgcaggctg 
gccaaggcag 
ctccaagaag 
cggcagtggg 
gggagcccag 
tgccccccgc 
ctgctgccga 
cattgctgac 
agcgttcccg 
accacaggag 
tttacctggg 
ctttttcttt 
agtcagagaa 
cccgctgctc 
aaggatccac 
agaagcagtg 
gggctctgtg 
cccgggctgg 
cagcctagcc 
ggcacagcag 
cagcaggcgc 
gggttcatgt 
cgggttttct 
ggaacaggtg 
gggagttcag 
aggagggtgc 
gaggacgagt 
cgggccttcc 
acctctgcca 
agggaagcca 
gggccgtggg 
cacctgctgc 
agggcgaaga 
agccggcccc 
cagaggccac 
cgccgggctt 
acggcccacc 
agagctctgg 
gccgtcatca 
ggtcattgct 
tccccattct 
cagggcccag 
ggggttcatc 
aagggtaaat 
tgcctcggtt 
ttggggatct 
gtgggacctg 
cagaggagcc 
tagccgggcc 
cctgtccgtc 
cccccatgcc 
gtgggcctaa 
ataacggccc 



gggaggcttg 
gaggctccag 
cctgtgctcc 
acagctgtct 
gtgacctcat 
tgaggttctg 
ggacaatccc 
tgggaactcc 
gtcctgctgg 
tgcatccagg 
aagaacaagg 
ggcgctgccc 
cccaggaaca 
gagacgcctg 
gcccctcttc 
cccctctggc 
aggcccctgt 
cactcaggga 
aatagacagt 
agtgaggtct 
tcagctgggc 
tgggaatgcc 
tggggggtgc 
gggccctggc 
tgggcgtctg 
tggctgtctg 
gcaccatcca 
tgaggtcggg 
ggttggtgta 
tcccaatagg 
gtcagggcag 
gtacccagag 
tgcaccccag 
gtgccgtgtg 
acctggcctg 
gcgcaaccag 
ccccaagcct 
gccggggcct 
ccagcctgga 
tgccaccctg 
cacccccaag 
agagctgtgc 
agtgggggcg 
gagccctgcc 
gtgcagtggg 
ggccccctct 
cggctcctga 
gctgggtgcc 
ccctgagagg 
agagagcacg 
gtccccctgc 
ccccctctgt 
gtcacccgct 
gaggtgctcc 
ccggccccag 
acccctcctg 
tgtcccctgg 
caagcccggt 
acccagctct 
cggaagatgt 



aaatgacaga 
ggaagaatcc 
ttgttcctgg 
cccacgtgtc 
cttaactaat 
ggtggacgtg 
cgggccccag 
acctgtctct 
gcgggtgagc 
ttggcgggga 
cccgcagcag 
ccgtaagcac 
gcgttgcctc 
gtgccacagc 
cttgcagggt 
cctccccagt 
ctgcccttgc 
tgagcaccgg 
atttttttcc 
tctcaggctc 
ccgtgggtgg 
atgctcatct 
ctgccgtggt 
acttagcagt 
ggggattgtt 
gggggattct 
ggcagggcac 
agagacctcc 
cagttccggg 
gatggccccg 
aatttcaggc 
atgctgctgg 
cccagtctgc 
tcgggacggc 
cctgtcccct 
gccaccccgg 
ctcccatcca 
ggggttttcc 
cagctgggcc 
tccaagctca 
ccccacccca 
cccccagggc 
ggaggcctcc 
cccattccaa 
gacccacgtt 
cagccttgtg 
agccgttcct 
attcccctta 
caggcaaagc 
ccaaggggac 
tgggctctcc 
gaaaagacat 
gtcttgttct 
agctgcctgc 
gagccacccg 
tccacatggc 
agtcctgtgg 
ccttgtggtc 
cctggctgcg 
gttccttgtt 



aatgccaaca 
ttccttgtgt 
gctcaggacc 
ctctttataa 
cacatctaca 
aacttcgggg 
actcgactgg 
gctagacccc 
caggaccagc 
gttctacact 
cagtggcccg 
ctgaccttcc 
tgggggagtg 
catgtgcacc 
ggaggtgagg 
gacccccagc 
tcccctcggg 
ggcctgagcc 
tgataatacg 
ttaagagcat 
gccgggcgcc 
ttcgtcccca 
gggttacaga 
gacaggagcc 
agaatgagtg 
ggaggaagtg 
aaggacggtg 
ctgggcctgg 
gcccctggaa 

gggggtgtct 

cctggcagca 
gggagctgtt 
atgggcgtct 
ggggagctca 
ccgctccggg 
ttcacggccg 
agatggaaag 
caccctgcca 
cctgagggca 
tcccaggctg 
gagtcccact 
aggtgggagt 
tctgcgttca 
ccccacagga 
caggcgaggc 
cctcatcact 
ccttgccgtc 
acaggtgggt 
caccagggct 
cctgatcacg 
cttcctgtgt 
ggtcggagcc 
gcatgtctct 
aggcaacagt 
tggagacccc 
cacgccccct 
gacaggactg 
tcctgcagtg 
ggtccacccc 
ctgctgctgt 
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11401 gagggtagta ggtctactgt gcacagaccc agtgttccct ctgacagccc tgagggccag 
11461 ggggcccccc gtgtgtagac gggggaggag ggaggaccac agagccagga agtgccacag 
11521 cctttcccac tcagtgtgga cgccttccac catgccagcc ctccgccccc accatgccag 
11581 gcctctgccc ccaccctgct gccctgggtt tcagggtccc agcagtcact gactcctggg 
11641 tggtgccggg caggcgcccg ctgcccctct gatgctgacc cttgggttcc agctcccccc 
11701 ggggcttagg tcggcgggag aggaggtaag aggtccacct ggggaacccc tagccggcat 
11761 ggacacgact cttgtctaca agcacctgcc ggctccgcct tctgcagccc cgctgccagg 
11821 gctggactcc tcggccctgc accccctact gtgtgtgggt cctgagggtc agcaggtgag 
11881 cggggagtgg gggtcagggt gggctcttca aggagcccag gacctacggg gcggatgaat 
11941 tcacctgaaa caggaggaga gggaggccag gcgagaaagg ctccgggagg cacagggcct 

12 001 ggggctgtgg ggggagcgtg gggggctgcg gggggaaggg gacgctccta gacctccact 
12061 ccagctcctg gccctgggca ttactgctcc ccccacaagg caggacaatg aaggggggga 
12121 tgtcccagca cacgtgggag ccctcccctc cctgcctcaa ttcccttccc tgcacccctg 
12181 tgggcaccgc ctttcaggag actcccgcac tcagccccaa aggaggccag gcccgccaag 
12241 caggagagag gtgcgggcgc caggcttgca ggcagcagcc tgagggtgct tgggtcgccc 
12301 ctgcctcctg gggatgggac tggtcccgct gtcctgcagc ctgcgtggca ccgtgaggct 
12361 cctcacttgc gcctagaccc gccgtccagc cctgggtggt cccaggggag agcgcacagg 
12421 gctcgggttc gggttcagct acatttcccc cggccccccg cgtcaccccg cgctgttgcc 
12481 tcccacagaa cctggctcct ggtgcgcgtt gcggggtgtg cggagatggt acggacgtgc 
12541 tgcggtgtac tcactgcgcc gctgccttcc actggcgctg ccacttccca gccggcacct 
12601 cccggcccgg gtgagtgagc gtggtcggcg gggaggcctg aacccacacc cacaccctac 
12661 accccacccc acactcccca cccacatcat acagccacaa ccacacccca cccacacccc 
12721 acactcccac ccacaccttg caccccaccc cacacccatg ccctgcaccc acaccctaca 
12781 ctccacagcc acactccacc acacccccac ccacacccta ctccccacct cataccctgc 
12841 acctcaccac actccacagc cacaccccac cccacacccc acactcccac ccacacccta 
12901 cacccacccc acaccctaca cccaacccaa acccacccaa acccaccact cccactctcc 
12961 acccacaccc acaccccttc ctcacacccc acacccccat cccccactca ccacccacgc 
13021 ccacacccca caccccatac cccggaggtg gcactcctgc tcccccccag ggctggcagc 
13081 ccctcatcct ctgctgcagg acgggcctgc gctgcagatc ctgctcagga gacgtgaccc 
13141 cagcccctgt ggagggggtg ctggccccca gccccgcccg cctggcccct gggcctgcca 
13201 aggtcagtgc cgcaggggcc ctccatgcat gccggtgctg ggggtgggga accccttggg 
13261 ttggtgttgg gggagcacat ctcagggcag accctgggtg ccagcttcga gggcttgcac 
13321 cagacgcact gaccatgtgc tcattatctg tagaaaatat ttccccttta aaccaattct 
13381 ttttggcaac ttaaatatag ttaaaaagga agctcccccc gagggttggt ggctgacgtc 
13441 acggttggct gtgtggccgc ctcacagcat gagcctgaga gtcctgccag ggctccctgg 
13501 tggggtgaag ggagagcggg agcgcccggc ctgcaggagc aaacccccac cctgtctgac 
13561 ccctccaggt tgtctcaccc ccagccctcc ctggggccag gatccacccc actgtgtggc 
13621 cagagccctc tcagagaggc aaagtgaccc cgggtccagc cagtagctct tcctgtcctc 
13681 ctgctccggg gtcagagagg acctgggtgg cgcggagacc cctgactgct ggggcggctg 

13 741 ggcttgccct ggagctgggt gtgggggagg cccgagtcgc tgctgcagga gcctccgggg 
13 801 gggtggcctc ttgccctgac cgtccccagc agaggcctcc tgagcacatc ctggccaccg 
13861 aggagccttt agggatcctg gggtgatgac acgtcccacc tgctccactg gcccatgctc 
13921 tttcccagct gtgcctccgc cccgtataca ccgtgtgggt gacaggccac cccggcgtgg 
13981 tactccccag gagggtgaca gcctacccca gcgtggtact ccccgggcag gtgacaggct 
14041 tccccggcat gcaggctctg gcctggcatg gcacaagcct cagacccagc cctgcccttg 
14101 gggcttttgt ggaacagtgg cgtggcccac agctgtcact gtccccttcc ttctagaagc 
14161 ctccctcctc acaccaccca tctggagtca ggagcccagc cgggcatata cgcagatgcc 
14221 cctccctaac cccaggcagc tttcctgcaa ctgctcccgc agcgggtacc tcgtcattaa 
14281 cctcctgggt tctgtctctg aacagcagag acctctttct tgtcatcgtg atgtgaaatg 
14341 taacgccatg tcagaggaaa agttctggct ggccttggcc tcccccctca gcctgccccc 
14401 ttcctccagg gtggttggac gtggccccag accccatcct gagcagctct cccaccccct 
14461 gggagcatcc ttaggaccgg ggagcatcca ggggctttcc cctccagacc gggcagcccc 
14521 tccctcagcc atgcagggct gccgggcctc gcagcgccag tgttcacccg agtggaggag 
14581 ctgggatgtg gctgtttggg gccacaaatg gggaattcca cagggttcaa tgtaatatgg 
14641 tctcctctct gctgggggtg cctgcctggg gaccttctcc cactctggtc gctcacctat 
14701 agtgtgggct ggccctggtg gtgcttgtcg ggggcggggg tggcatggac caggcacttt 
14761 cctctctggg cctcagactt cccctctcag agtgggactc cttgctggtt ccctgagctc 
14821 cctcgttttc cccaggaggc cacacagtgt ggaggctgtc tgggggccgt gggcagctgg 
14881 ccgtgggcag gaccctgggg aggcagcccc agccccatca tgcccacgca gccctgtgcc 
14941 cccaccccca gtggagctgg gtgtaagaat tcccatctca gtgtggggga aacacccccg 
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15001 cggcccctag gccctgcggc ctctgtaccc ccaccagggc tgtgggagtt gggctgacct 

15061 cttctcttta ctgggttcca ggatgacact gccagtcacg agcccgctct gcacagggat 

15121 gacctggagt cccttctgag cgaggtaacg cctcccctgg cctcctggtg ctcctccact 

15181 ccccctcccc tgcctcagcc ggcacccagg ctccccactc tgggggagga ctgccggccc 

15241 ccactgctct tgagccgtgg aaactcaggc tgtccctgct ccacccacca ggagccccag 

15301 tgctgctgag cacctggcac cccccacagg agccccccta gcccccttgc aggagccccc 

15361 cccggcccct ccccctgcgg gagcccagtg ctgctgagcg cccccagccc ctccccaaca 

15421 agagccccca cacggcccct cccctgaggg cctgcaccct ggcaggcaga ggctcgagca 

15481 ccaggctcaa gatccacttt cccagggagg gtggggcgtg ggagtggggg gggggtccca 

15541 gaccccgtcc ctctaagatt tgcttgcccc tcccaactca ggcctctcta cgctaagatg 

15601 ggcaggtaga atctgtgggg aaaatgtgac ttttaagggc tctgtctgtt tttgccaaga 

15661 ggataagctc cttcagcctc cacgggttct cctcagtgtc tgatgtggca cccgggggtc 

15721 ccagctgacc atggggcagg ggttctgccc tgtgcagtgg ccgtgcccca cacaccctga 

15781 ccgtgcaggt gtctgcagag ccccagggcc tgagagtggg ccagggggcc cagcgctggg 

15841 taatggagct gcccctctgg atggggtccc cgggtatagc tggagaaatg agcgacgggc 

15901 tcacagcctc tcccgggtgg cggtcttatt ctgctggcat cgtggggccc gtggccccat 

15961 cctgtgggag catcaggctc ctgagcagaa taagtagctg gccccgaccc ccccaccctg 

16021 aaggagccac ccgaggaggc agaactgcca tgaactgcca tggggatgtg ccctgggctt 

16081 ataggatgtg gtgaagtaca caggacaggg tcctcggtct ggcctgtgcc atggggacct 

16141 tgggcctcag tttccccacc tttgatggaa tacggtgaag tgcacaggac agggtcctcc 

16201 ccagactggc ctgtgccatg gggcctcggg cctcagtttc cccacctttg acttagaggg 

16261 aaggttggat ggtgacttct tgtaacgatg gccatgattc tgtggctgcg gcgggggcgc 

16321 acctggaggt tctcaccgtc actctgtccc gcagcacacc ttcgatggca tcctgcagtg 

16381 ggccatccag agcatggccc gtccggcggc ccccttcccc tcctgacccc agatggccgg 

16441 gacatgcagc tctgatgaga gagtgctgag aaggacacct ccttcctcag tcctggaagc 

16501 cggccggctg ggatcaagaa ggggacagcg ccacctcttg tcagtgctcg gctgtaaaca 

16561 gctctgtgtt tctggggaca ccagccatca tgtgcctgga aattaaaccc tgccccactt 

16621 ctctactctg gaagtccccg ggagcctctc cttgcctggt gacctactaa aaatataaaa 

16681 attagctggg tgtggtggtg ggtgcctgta atcccagcta catgggagcc tgaggcatga 

16741 gaatcacttg aactcgggag gtggaggttg cagtgagctg agattgcgcc actgcactcc 

16801 agtctggtcg gcaagagtga gactccgtct caaaaacaaa acaaaacaaa aaaaccacat 

16861 aacataaatt tatcatctcg accacttttc agttcagtgg cattcacatc tcatgtaacc 

16921 attgccacca ccatctgcag agctgttcca gcttcctgaa tggaaactct ggccccatga 

16981 aacactcact ccccattccc ctcccaaccc ctggcaccct ccactctact ttctgtctct 

17041 atgacttctc cagggacctc atggaagtgg aatcacacag tatctgtcct tttgtgtctg 

17101 tcttatttta ctcagcatgg tggccccaag gttcatccat gttgtagcat gtgccgaaat 

17161 ctgcttcctt ttttaaggct gagtaatatt ccagcacatg gatggactgc attgtgttca 

17221 tctatgcttt cctcagcaga cacctgggtg cttccacctt ttggctgttg tgagtaatgc 

17281 tgctatgaat atgtgtgtgc aaatatctgt gtgagtcttt gagtaaaatc gctggatcac 

17341 atggtaatta tatttttaat tttttgaaga actgccatcc tgttttccac agcagctaca 

17401 ccatttcaca tccccaccaa cactgcatga gggttccagc ctccccatat ccttgaaaca 

17461 attactattt tctgcatttt tctttctttt cttttcttct ttttttgaga cggagtttcg 

17521 ctcttgctgc ccaggctgca gtgcaatggc gcagtctcca ctcactgcaa cctctgcctc 

17581 ctggcttcaa gtgattctcc tgcctcagcc tcctgcatag ctgggattac aggcgcacac 

17641 caccatgccc ggctaatttt tgtattttta gtagagacgg ggtttcacca tgttggctgg 

17701 gctggtctcg aactcctgac ctcagacctc aggtgatcca cccccccccc ccccccccac 

17761 cccacgggcc tcccaaagtt ctgggattat gggcgtgagc cactgcgccc agcctgtgtt 

17821 tttcttttat agtagccatc ctaatgggta tgaggtggta cctcattgtg gctaaaccat 

17881 gtacgtttaa gaatttacac acacacacac acacacacac acacacacgt gatgaagctg 

17 941 taagaaaagg gagagaataa tgacccaaaa tgtcagcaca aggatgcctc aggggtagac 

18001 acaggtgtct tcttagcgcc ttttagccca cgcatatgtg cttgcgcagt ctctcggtgt 

18061 gttttctaac atttcacaat taaataagga gtagttgata ttcacagcca ccatttccac 

18121 cgtatatgta gagaaaaatg ctcaaatcct agcgatcagc ccgctgagct ttcacaaacc 

18181 gaacacaccg tgcggtcacg aggcggctct cccgaacacc ggctctgcac ccccagggcc 

18241 cggggaacca agacttcccc aagggcccag caaaagtgag ggaggagggg aggaaaggag 

18301 ggagagggag ggaggagtgc tggctgcttt ccgccagcgc ctcgtttggg tttcgctctg 

18361 ctgaggcctg aaggcgccct gagggcccct ggggggcagg gaggggcggt ggctgcggcc 

18421 gacagcaagg aacctccgga gaagcgaagc cgggccagac gtgccaccgg ggcctcggcg 

18481 ctggcgaagg ggcgcgtgcg gccgcactgg ggccagcccc gccctggtcc ccggccacat 

18541 cctcgagacc accggggtgg aggggggggt ctcactgctc agggcccgcg tgacattgca 



http://www.ncbi.nlm.nih.gov/entrez/viewer. fcgi?db=Nucleotide&dopt=GenBank&val=269... 2/19/2004 



NCBI Sequence Viewer 



Page 9 of 9 



18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 
19681 
19741 
19801 
19861 
19921 
19981 



// 



caggtttaga 
ctccgtccag 
gaaggcgcgc 
tgggcccgtg 
ggcgcgggca 
tcccggccgc 
cccgccgagc 
cccggcctcc 
gcggccggcc 
gggtgtgtcc 
agcggagggc 
cggggacccg 
ggctgggcgg 
ccttccttgt 
gagcaaagcc 
actgggactt 
ccctcccgga 
cagtgtggca 
ggatttagga 
gctgtgcagc 
cacacacccc 
agaggcgaga 
gggtgtcact 
gcctgtggtg 



aacgcacagc 
ggtccgtcct 
gcggggcggg 
tttcggccgc 
aggccatcgg 
gtcgcggcgc 
cccgggaccc 
tgccccgggt 
acgttctcgg 
gactctgggc 
gccggtcccg 
gtgcctcccg 
gggcggactc 
gggtctagct 
cagttgtcac 
ggcagagctg 
ggcctggtcc 
cctgtggcac 
gaccagaagt 
ggggtctggt 
cacccagctt 
aggaagccgg 
gtctgaaccc 
gctgcccacc 



gggaaggtgg 
gcgaggaggc 
gcggggacgg 
cgccatggcc 
cgtcctgacc 
agggggtgga 
gggtctcggg 
ctgtcccccg 
cctccgcctg 
cgcggggcag 
ccacccccag 
agggccggcg 
actcccggac 
ggggaagagg 
ccttttcgca 
ggtgggggtc 
tgcccactgc 
aggacgccac 
gaggtgctgg 
ctgtctttta 
caggcacctg 
ccttgagtgg 
cggcctgcag 



ccccggcggg 
gccgggggcg 
cgacgcggcg 
gcggtggacc 
agcggcggcg 
gggcgcctcc 
ccggcccggc 
ctcttccccg 
ggtcttggcg 
gacctcccga 
agcctggccc 
ccaggctgcg 
cgcgcctcct 
gcggtgggga 
ggggtactgg 
cggcctgagg 
cggcctccag 
ctaccctacc 
gttgggaagg 
gagttcagaa 
accctccttt 
gaccaagcca 
agggggctcc 



ccgagggcgc 
ggcccggggc 
caggcggcgg 
tggagaagct 
acgcgcaagg 
gccctggcct 
ctcccgcccc 
gcgcggtctc 
ccgcccgcgc 
ggagtccccg 
ttcctggttc 
ggcttccggg 
cctcggggcc 
gatgggctgc 
ggaggcctca 
gcgtggtgtc 
tgtctctggc 
cctcaggagg 
ggcccagcca 
aatggttgag 
tccctatctg 
gggaggcagg 
ttcagacctg 



aggatccctg 
ggggcgggcg 
gagtgcgagc 
gcgggcgtcg 
tgggcggggg 
ctccggagcg 
ggcctcccgc 
cggcctacgt 
cctgccccgg 
aggccacgcg 
tgaagcgacc 
acctcctccc 
ctcgcggccg 
ccggggcatt 
ggtggtgggg 
ctcgcacgct 
ctgacatgtc 
agagaggaga 
gcaccccaag 
aaaatgcgga 
gaagccagga 
agtggctcgt 
gctgagccag 
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